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2 serial number 07/997, 1 58 filed Dec. 21, 1992, now U.S. patent 

3 which is a continuation of application serial number 07/532,462 filed on June 1, 

4 1990 by Gerardus CP. Lokhoff for DIGITAL TRANSMISSION SYSTEM, 

5 TRANSMITTER AND RECEIVER FOR USE IN THE TRANSMISSION SYSTEM, 

6 AND RECORD CARRIER OBTAINED BY MEANS OF THE TRANSMITTER IN THE 

7 FORM OF A RECORDING DEVICE, now abandoned. 
8 

9 BACKGROUND OF THE INVENTION 

10 1. Field of the Invention 

1 1 ^ 6 ' nvent ' on re ' ates to a transmissionsVstem for producing a replica of a 

12 $tJ^ wideband digital signal which includes at least a first and a second component; 

1 3 / and more particularly to such a system/which comprises an encoder including an 

14 analyzer for altering the digital sigpral to obtain a number n of sub-signals for said 

15 digital signal; a transmitter for/transmitting the sub-signals for reception at a 

16 different time or place; a receiver for receiving the sub-signals; and a decoder 

17 including a synthesizej/for combining the received sub-signals to obtain 

18 respective replicas/of the digital signal. The invention also relates to an 

19 encoding transnpntter, and a decoding receiver for such a system. 
20 

21 2. Description of the Prior Art 

22 One transmission system of this type is known from an article, "The 

23 Critical Band Coder - Digital Encoding of Speech Signals Based on the Perceptual 

24 Requirement of the Auditory System" by M. E. Krasner, published in Proc. IEEE 

25 ICASSP 80, Vol. 1 , pp 327-331 , April 9-11, 1 980. This article relates to a 

26 transmission system in which the sub-signals are signals representing frequency 
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1 bands. The transmitter includes a frequency subband coding system in which 

2 the speech signal band is divided into a plurality of subbands whose bandwidths 

3 approximate the bandwidths of the critical bands of the human ear in the 

4 respective frequency ranges (see Fig. 2 of this article). This division is selected 

5 because, based on psycho-acoustic experiments, one can expect that 

6 quantization noise in such a subband will be masked to an optimum extent by 

7 the signals in that subband, if during quantization allowance is made for the 

8 | noise-masking curve of the human ear. Threshold values for noise masking by 
9 ^ single tones are shown in Fig. 3 of this article. The receiver employs a 

^ corresponding subband decoding system. 

jl 1 When applying frequency subband coding to a high-quality digital music 

12 signal, such as one according to the Compact Disc Standard which uses 16 bits 

13 per signal sample at a sample frequency of 1 1T = 44.1 kHz, with a suitably 

14 selected bandwidth and a suitable selected quantization for the respective 

15 subbbands, the quantized output signals of the coder can be represented by an 

16 average number of approximately 2.5 bits per signal sample. The quality of the 

17 replica of the music signal does not differ perceptibly from that of the original 

18 music signal in substantially all passages of substantially all kinds of music 

19 signals. 

20 The subbands need not necessarily correspond to the bandwidths of the 

21 critical bands of the human ear. For example, the subbands may have equal 

22 bandwidths, provided that allowance is made for this in determining the masking 

23 threshold. 

24 The invention is also applicable to other types of transmission systems, 

25 such as those in which blocks of samples are transform coded. Such systems 

26 are referred to in the article "Low bit-rate coding of high-quality audio signals. 

27 An introduction to the MASCAM system" by G. Thiele, G. Stoll and M. Link, 

28 published in EBU Technical Review, no. 230, pp. 71-94, August 1988. In such 

29 a system the transform coefficients correspond to the sub-signals. 
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~fhe sub-signal transmission systems described above have the 
disadvantage that, in some cases, perceptible differences occur between the 
replica and \he signal which was to be transmitted. These differences are 
perceived as aVform of distortion in the replica generated by the receiver. Often 
they are the resW of the number of bits, available for quantization of certain of 
the sub-signals, being too low. 

\ SUMMARY OF THE INVENTION 

An object of the invention is to enable transmission of signals 
representing a wideband digital signal with a significant reduction of the 
distortion present in the replica generated in the receiver. 

Another object of the invention is to identify sub-signals, corresponding to 
first and second signal components which are related to each other, which can 
be combined to obtain a composite^signai which is transmitted in place of those 
sub-signals. \ 

Another object of the invention is\to transmit digital signals corresponding 
to stereo audio signals with reduced distortion in the generated replica. 

According to the invention, a system W described above further 
comprises control circuitry for determining andXoptimizing bit allocation, and a 
signal combiner for combining selected corresponding sub-signals from the first 
and second components of the original digital signal to obtain one or more 
composite sub-signals, and an indicator generator forVjenerating an indicator 
signal indicating that these corresponding sub-signals are combined. The 
receiver is responsive to the indicator signal, for generating a signal relating to 
that composite signal and related to at least one of said firsthand second 
components. Preferably, the receiver includes a decoder which synthesizes a 
signal which is the desired replica, by combining the transmitted\sub-signals and 
composite sub-signals. \ 

The invention is based on recognition that the numbers of bits made 
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available for different sub-signals are not optimally allocated, so that 
quantization of certain sub-signals is too rough. This leads to audible distortion 
in a repliisa resulting from decoding of the received signal. By selectively 
combining^ubsignals which have a correspondence or relationship to each 
other, and quantizing only one composite sub-signal, so as to make more bits 
available for quantizing of those sub-signals which are transmitted, the reduced 
quantizing distortion may more than compensate for the slight loss of 
information in the replica. This is especially true when the sub-signals which are 
combined are signals corresponding to a same frequency subband, such as left 
and right stereo, or othe\ spatial-differentiating signals, in music or audio 
transmission. \ 

Alternatively, the combosite signal may itself be quantized with a greater 
number of bits than if the two^sub-signals were quantized separately. 

In a preferred embodiment^ a control or central processing unit, and an 
allocation control unit, together functioning as control or steering circuits, 
control the signal combiner to combirve in each of a number m n of said subbands 
the subband signals of the first and second components in those subbands, to 
obtain m, composite signals in said m, subbands, where m n is greater than 1. 
The signal generator generates an indicator signal identifying which subbands 
had their corresponding sub-signals combinedX This indicator signal will function 
as a steering control signal. The transmitter transmits these composite signals 
in the nr^ subbands, the indicator signal, and the remaining sub-signals which 
have not been combined. In this embodiment, the receiver decoder has a 
deriving circuit for deriving m, subband signals from thk composite signals in the 
m, subbands, and combining these with the subband sigrVals which were 
transmitted. \ 

A variation of this embodiment allows a still greater redaction of the data. 
An allocation control circuit in the transmitter determines the bioavailability 
after the subbands have been processed to form the composite'sub-signals. 
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If bit availability is still such that quantization of some subbands will be too 
rough, then a second evaluation is made with a greater number of subbands 
being combined. For example, in each of a number m 2 subbands the subband 
signals oX said signal components are combined to obtain m 2 composite signals 
in said m 2 subbands. The value m 2 will be greater than m 1f and will preferably 
include all of the m, subbands. 

In this embodiment the signal generator will then generate a different 
indicator signal, identifying the m 2 subbands, and the transmitter transmits these 
composite signals in the m 2 subbands. In the receiver the deriving circuit 
derives first the m 2 composite signals in the m 2 subbands from the signal 
received, and then derives Vom these m 2 composite signals, in response to the 
indicator signal, subband signals in the m 2 subbands corresponding to said signal 
portions. 

In a typical audio subband division, the m., subbands are the m, highest 
subbands; and if further combining is required, then the m 2 subbands are the m 2 
highest subbands. This method of combining takes advantage of the fact that 
the human ear is less phase sensitive in thoste frequency bands. In one 
embodiment discussed more fully below, the value m, is half of the number M of 
subbands. For example, if M = 32, the highest (Highest frequency) 16 bands 
may initially be selected for combining. A value of 20 may be used for m 2 , and 
the process can be repeated for m 3 = 24 and m 4 = 28\ 

In yet another preferred embodiment, the transmitter comprises a scale 
factor determiner, for determining a scale factor for time equivalent signal blocks 
of the first and second components in the subband signals; and. the transmitting 
section transmits these scale factors. The detector in the receiver is adapted to 
detect the scale factors which have been transmitted, and to controYa multiplier 
for the subband signals before the full bandwidth signal is reconstructed in a 
synthesis filter. Correction for any pre-emphasis is made after reconstruction. 
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BRIEF DESCRIPTION OF THE DRAWING 



Fig. 1 is a diagram of a second digital signal generated by a transmitter 
according to the invention, organized as frames each composed of information 
packets, . 



Fig. 2 is a diagram of the structure of a frame according to a preferred 

embodiment including scale factors, 

/ 

Fig. 3 is a diagram of the structure of the first portion of the frame of Fig. 

2, 

Fig. 4- is a block diagram of a system according to the invention 

Fig. 5/is a table showing the number of information packets B in a frame, 
for certain values of bit rate BR and sample frequency F s 

Fig. 6 is a table showing the numbers of frames in a padding sequence for 
different bit rates, 

Fig. 7-is a table showing the system information included in the first 
portion of a frame, 

Fig. 8 is a table showing a distribution of information between channels 
for diffferent modes, 



Fig. 9 is a table of meanings of allocation information inserted in the 
second portion of a frame, 

Figs. 10 and 11 are tables showing sequences in which allocation 
information is stored for two different formats, 



Fig. 12! is a block diagram of a receiver according to the invention, 

/ 

Fig. 1 3 is a simplified block diagram of a transmitter which records the 
second digital signal on a magnetic record carrier, 

Fig. 14 is a simplified block diagram of a receiver for producing a replica 
signal from the magnetic record carrier of Fig. 13, 

Figs. 1 Sa - 1 5d' are diagrams of different arrangements of scale factors 
and samples in the third portion of a frame, 

/ 

Fig. 16 is a block diagram of one preferred transmitter arrangement, 
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1 Fig. 1 7 is a diagram of another structure for the first portion of a frame, 

/ 

2 Fig. 18 is a table showing system information included in the structure of 

3 Fig. 17, ^ 

4 Fig. 19 is a diagram of a structure for a portion of the structure of Fig. 

5 1 7, / 

6 Fig. 20 is a table showing bit codings in an embodiment of the structure 

7 of Fig. 19, ^ 

8 Fig. 21 is a table showing a sequence for allocation information 

9 accommodated in a second frame portion associated with the first portion of 

10 Fig. 17, for a monaural mode, 

/ / 

1 1 Figs. 22a-d are tables showing sequences for allocation information 

1 2 accommodated in a second frame portion associated with the first portion of 

13 Fig. 17, for a^stereo intensity mode, 

14 Fig. 23 is a diagram of a frame structure including an additional signal, 

15 Figs. 24 is a binary number diagram relating the sample with largest 

16 absolute value^to an intermediate value used for scale factor computations, 

17 Fig. 25^ is a table showing quantization of scaled samples to form q-bit 

18 digital representations, and 

19 i Fig. 26 is a table showing dequantization of the q-bit digital 

20 representations. 
21 

22 DESCRIPTION OF THE PREFERRED EMBODIMENTS 

23 

24 Fig. 1 shows diagrammatically the second digital signal as generated by 

25 the transmitter and transmitted yja the transmission medium. The second digital 

26 signal takes the form of the serial digital data stream. The second digital signal 

27 comprises frames, two such frames, i.e. the frame j and the frame j + 1, being 

28 given in Fig. 1a. The frames, such as the frame j, comprise a plurality of 

29 information packets IP1, IP2, !P3, see Fig. 1b. Each information packet, such 
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1 as IP3, comprises N bits b Q , b-,, b 2 , b N1 , see Fig. 1c. 

. 2 Number of Packets 

3 The number of information packets in a frame depends upon 

4 (a) the bit rate BR with which the second digital signal is transmitted through 

5 the transmission medium, 

6 (b) the number of bits N in an information packet, N being larger than 1, 

7 (c) the sample frequency F 3 of the wide-band digital signal, and 

8 (d) the number of samples n s of the wide-band digital signal. 

9 The information which corresponds to these packets, and which after 
10 conversion in the transmitter belongs to the second digital signal, is included in 

- 1 1 one frame in the following manner. The parameter P is computed in conformity 

1 2 with the formula 

,o ^ p - BR n s 

13 ^o "--n x t:- 

14 If this computation yields an integer for P the number of information packets B 

15 in a frame will be equal to P. If the computation does not result in an integer 
1 6 some frames will comprise P' information packets and the other frames will 

1 7 comprise P' + 1 information packets. P' is the next lower integer following P. 

1 8 The number of frames comprising P' and P' + 1 information packets is selected in 

19 such a way that the average frame rate is equal to F s /n s . 

20 Hereinafter it is assumed that N=32 and n g = 384. The table in Fig. 5 gives 

21 the number of information packets (slots) in one frame for these values for N 

22 and n a and for four values of the bit rate BR and three values for the sample 

23 frequency F s . It is evident that for a sample frequency F s equal to 44.1 kHz the 

24 parameter P is not an integer in all cases and that consequently a number of 

25 frames comprise 34 information packets and the other frames comprise 35 

26 information packets (when BR is 128 kbit/s). This is also illustrated in Fig. 2. 

27 Fig. 2 shows one frame. The frame comprises P' information packets IP1, 

28 IP2, IP P'. Sometimes the frame comprises P' + 1 information packets. This 
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1 is achieved by assigning an additional information packet (dummy slot) to the 

2 frames of P' information packets. The second column of the table of Fig. 6 gives 

3 the number of frames in the padding sequence for a sample frequency of 44-. 1 

4 kHz and the aforementioned four bit rates. The third column specifies those 

5 frames of that number of frames in the sequence which comprise P' + 1 

6 information packets. By subtracting the numbers in the second and the third 

7 column from each other this yields the number of frames in the sequence 

8 comprising P' information packets. The (P' + 1 )th information packet then need 

9 not contain any information, and may then comprise for example only zeroes. 
10 It is obvious that the bit rate BR is not necessarily limited to the four 

. 1 1 values as given in the tables of Figs. 5 and 6. Other (for example intermediate) 

1 2 values are also possible. 

13 Fig. 2 shows that a frame comprises three frame portions FD1, FD2 and 

14 FD3 in this order. The first frame portion FD1 contains synchronising information 

15 and system information. The second frame portion FD2 contains allocation 

16 information. The third frame portion FD3 contains samples and, when applicable, 

17 scale factors of the second digital signal. For a further explanation it is 

1 8 necessary to first describe the operation of the transmitter in the transmission 

19 system in accordance with the invention. 

20 The Transmission System 

21 Fig. 4 shows diagrammatically the transmission system comprising a 

22 transmitter 1 having an input terminal 2 for receiving the wide-band digital signal 

23 S BB , which may be for example a digital audio signal. In the case of an audio 

24 signal, this may be a mono signal or a stereo signal, in which case the digital 

25 signal comprises a first (left channel) and a second (right channel) signal 

26 component. It is assumed that the transmitter comprises a coder for subband 

27 coding of the wide-band digital signal and that the receiver consequently 

28 comprises a subband decoder for recovering the wide-band digital signal. 

29 The transmitter comprises an analysis filter 3 responsive to the digital 
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wide-band signal S BB to divide the wide band into a plurality M of successive 
frequency subbands having band numbers m, where 1 < m < M, which 
increase with frequency. All these subbands may have the same bandwidth but, 
alternatively, the subbands may have different bandwidths. In that case the 
subbands may correspond, for example, to the bandwidths of the critical bands 
of the human ear. The analysis filter generates subband signals S SB1 to S SBM , for 
the respective subbands. The transmitter further comprises circuits for sample- 
frequency reduction and block-by-block quantization of the respective subband 
signals, shown as the block 9 in Fig. 4. 

Such a subband coder is known and is described, for example, in 
the aforementioned publications by Krasner and by Theile et al. Reference is 
also made to the published European Patent Application 289,080, to which U.S. 
patent 4,896,362 corresponds. 

For a further description of the operation of the subband coder 
reference is made to these publications, which are therefore incorporated herein 
by reference. Such a subband coder enables a significant data reduction to be 
achieved in the signal which is transmitted to the receiver 5 through the 
transmission medium 4, for example a reduction from 1 6 bits per sample for the 
wide-band digital signal S BB to 4 bits per sample if n 3 is 384. This means that 
there are blocks of 384 samples of the wide-band digital signal, each sample 
having a length of 1 6 bits. If a value M = 32 is assumed, the wide-band digital 
signal is split into 32 subband signals in the analysis filter means 3. Now 32 
(blocks of) subband signals appear on the 32 outputs of the analysis filter 
means, each block comprising 12 samples (the subbands have equal width) and 
each sample having a length of 1 6 bits. This means that on the outputs of the 
filter means 3 the information content is still equal to the information content of 
the block of 384 samples of the signal S BB on the input 2. 

The data reduction circuit 9 operates on the output of the filter 3 using 
the knowledge about masking. At least some of the samples in the 32 blocks of 
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# # 

1 12 samples, each block for one subband, are quantised more roughly and can 

2 thus be represented by a smaller number of bits. In the case of a static bit 

3 allocation all the samples per subband per frame are expressed in a fixed number 

4 of bits. This number can be different for two or more subbands but it can also 

5 be equal for the subbands, for example equal to 4 bits. In the case of dynamic 

6 bit allocation the number of bits selected for every subband may differ viewed in 

7 time, so that sometimes even a larger data reduction can be achieved, or a 

8 higher quality with the same bit rate. 

9 The subband signals quantised in the block 9 are applied to a 

10 generator unit 6. Starting from the quantised subband signals this unit 6 

1 1 generates the second digital signal as illustrated in Figs. 1 and 2. This second 

12 digital signal, as stated hereinbefore, can be transmitted directly through the 

1 3 medium 4. Preferably, however, this second digital signal is first adapted in a 

14 signal converter (not shown), such as an 8-to-10 converter. Such an 8-to-10 

15 converter is described in, for example, European Patent Application 150,082 to 
1 6 which U.S. patent 4,620,31 1 corresponds. This converter converts 8-bit data 

17 words into 10-bit data words, and enables an interleaving process to be applied. 

18 De-interleaving, error correction and 1 0-to-8 conversion are then performed in 
1 9 the receiver. 

20 Frame Format 

21 The composition and content of the frames will now be explained in 

22 more detail. The first frame portion FD1 in Fig. 2 is shown in greater detail in 

23 Fig. 3. Fig. 3 shows that the first frame portion comprises exactly 32 bits and is 

24 therefore exactly equal to one information packet, namely the first information 

25 packet IP1 of the frame. The first 16 bits of the information packet form the 

26 synchronising signal (or synchronising word), and may comprise for example 

27 only "ones". The bits 16 to 31 are system information. The bits 1 6 to 23 

28 represent the number of information packets in a frame. This number 

29 consequently corresponds to P\ both for the frame comprising P' information 
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1 packets and for frames comprising the additional information packet IP P' + 1. P' 

2 can be at the most 254 (1111 1 1 10 in bit notation) in order to avoid 

3 resemblance to the synchronising signal. The bits 24 to 31 provide frame 

4 format information. 

5 Fig. 7 gives an example of the arrangement and significance of this 

6 information. Bit 24 indicates the type of frame. In the case of format A the 

7 second frame portion has another length (a different number of information 

8 packets) than in the case of format B. As will become apparent hereinafter, the 

9 second frame portion FD2 in the A format comprises 8 information packets, 
10 namely the information packets IP2 to IP9 inclusive; and in the B format it 

1 1 comprises 4 information packets, namely the information packets IP2 to IP5 

1 2 inclusive. The bits 25 and 26 indicate whether copying of the information is 

13 allowed. The bits 27 to 31 indicate the function mode. This means: 

14 a) the channel mode, which indicates the type of wide-band signal (as stated 

15 hereinbefore this may be a stereo audio signal, a mono audio signal, or an 
1 6 audio signal comprising two different signal components for example 

17 representing the same text but in two different languages). Fig. 8 

18 represents the channel mode. It illustrates how the signal components are 

19 divided between the two channels (channel I and channel II) in the 

20 aforementioned cases. 

21 b) the sample frequency F s of the wide-band signal. 

22 c) the emphasis which may be applied to the wide-band digital signal in the 

23 transmitter. The values 50 and 1 5 jus are the time constants of the 

24 emphasis and CCITT J. The value 17 indicates a specific emphasis 

25 standard as defined by the CCITT (Comite Consultative Internationale de 

26 Telegraphie et Telephonie). 

27 The content of the frame portion FD2 in Fig. 2 will be described in more 

28 detail with reference to Figs. 9, 10 and 11. In the A format the second frame 

29 portion contains eight information packets. This is based on the assumptions 
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1 that the wide-band digital signal S BB is converted into 32 subband signals (for 

. 2 every signal portion of the digital signal S BB ), and that an allocation word having 

3 a length of four bits is assigned to every subband. This yields a total of 64 

4 allocation words having a length of 4 bits each, which can be accommodated 

5 exactly in eight information packets. In the B format the second frame portion 

6 accommodates the allocation information for only half the number of subbands, 

7 so that now the second frame portion comprises only 4 information packets. 

8 Fig. 9 illustrates the significance of the four-bit allocation words AW. 

9 An allocation word associated with a specific subband specifies the number of 

10 bits by which the samples of the subband signal in the relevant subband are 

1 1 represented after quantisation in the unit 9. For example, the allocation word 

12 AW which is 0100 indicates that the samples are represented by 5-bit words. 

13 Moreover, it follows from Fig. 9 that the allocation word 0000 indicates that no 

14 samples have been generated in the relevant subband. This may happen, for 
1 5 example, if the subband signal in an adjacent subband has such a large 

1 6 amplitude that this signal fully masks the subband signal in the relevant 

1 7 subband. The allocation word 1111 is not used because it closely resembles the 

18 sync word in the first information packet IP1 . 

19 Fig. 10 indicates the sequence, in the case that the frame mode is A, in 

20 which the allocation words AW, j,m associated with the two channels j, where 

21 j = i or II, and the 32 subbands of the sequence number m, m ranging from 1 to 

22 32, are arranged in the second frame portion. The allocation word AWI,1 

23 belonging to the first subband signal component of the first and lowest subband 

24 (channel I, subband 1) is inserted first. After this the allocation word AWII.1 

25 belonging to the second subband-signal component of the first and lowest 

26 subband (channel II, subband 1) is inserted in the second frame portion FD2. 

27 Subsequently, the allocation word AWI,2 belonging to the first subband-signal 

28 component of the second and lowest but one subband (channel I, subband 2) is 

29 inserted in the frame portion FD2. This is followed by the allocation word AW 
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1 11,2 belonging to the second subband-signal component of the second subband 

2 (channel II, subband 2). This sequence continues until the allocation word AW 

3 11,4 belonging to the second subband-signal component of the fourth subband 

4 (channel II, subband 4) is inserted in the second frame portion FD2. The second 

5 information packet IP2 (slot 2) of the frame, which is the first information 

6 packet in the frame portion FD2 of the frame, is then filled exactly. 

7 Subsequently, the information packet IP3 (slot 3) is filled with AW 1,5; AW 11,5; 

8 ... AW 11,8. This continues in the sequence as illustrated in Fig. 10. 

9 Fig. 10 merely gives the indices j-m of the inserted allocation word AW, 
10 j,m. Fig. 1 1 indicates the sequence for the allocation words in the case of a B- 
1 1 format frame. In this case only allocation words of the subbands 1 to 1 6 are 

12 inserted. The sequence, as is illustrated in Fig. 10, corresponds to the sequence 

13 in which the separate samples belonging to a channel j and a subband m are 

14 applied to the synthesis filter means upon reception in the receiver. This will be 
1 5 explained in greater detail hereinafter. 

1 6 The serial data stream contains for example only frames in conformity 

1 7 with the A format. In the receiver the allocation information in each frame is 

18 then employed for correctly deriving the samples from the information in the 

1 9 third frame portion of said frame. The serial data stream may also comprise, 

20 more or less alternately, both frames in conformity with the A format and 

21 frames in conformity with the B format. However, the frames in conformity 

22 with both formats may contain samples for all channels and all subbands in the 

23 third frame portion. A frame in conformity with the B format then lacks in fact 

24 the allocation information required to derive the samples for the channels I or II 

25 of the subbands 1 7 to 32 from the third frame portion of a B format frame. 

26 The receiver comprises a memory in which the allocation information 

27 included in the second frame portion of an A format frame can be stored. If the 

28 next frame is a B format frame only the allocation information for the subbands 

29 1 to 1 6 and the channels I and II in the memory is replaced by the allocation 
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1 information included in the second frame portion of the B format frame. The 

- 2 samples for the subbands 1 7 to 32 from the third frame portion of the B format 

3 frame are derived from the allocation information for these subbands derived 

4 from the preceding A format frame and still present in the memory. The reason 

5 for the alternate use of A format frames and B format frames is that for some 

6 subbands the allocation information (in the present case the allocation 

7 information for the higher subbands 1 7 to 32) does not change rapidly. Since 

8 during quantization the allocation information for the various subbands is 

9 available in the transmitter, this transmitter can decide to generate a B format 
10 frame instead of an A format frame if the allocation information for the 

- 1 1 subbands 1 7 to 32 inclusive does not change (significantly). Moreover, this 

1 2 illustrates that now additional space becomes available for the inclusion of 

13 samples in the third frame portion FD3. 

1 4 For a specific value of P' the third frame portion of a B format frame is 

1 5 four information packets longer than the third frame portion of an A format 

1 6 frame. This enables the number of bits by which the samples in the lower 

17 subbands 1 to 1 6 are represented to be increased, so that for these subbands a 

18 higher transmission accuracy can be achieved. Moreover, if it is required to 

19 quantize the lower subbands more accurately the transmitter can automatically 

20 opt for the generation of B format frames. This may then be at the expense of 

21 the accuracy with which the higher subbands are quantized. 

22 The third frame portion FD3 in Fig. 2 contains the samples of the 

23 quantised subband-signal components for the two channels. If the allocation 

24 word 0000 is not present in the frame portion FD2 for any of the subband 

25 channels this means that in the present example twelve samples are inserted in 

26 the third frame portion FD3 for each of the 32 subbands and 2 channels. This 

27 means that there are 768 samples in total. 

28 Scale Factors 

29 In the transmitter the samples may be multiplied by a scale factor prior to 
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1 their quantization. For each of the subbands and channels the amplitudes of the 

2 twelve samples are divided by the amplitude of that sample of the twelve 

3 samples which has the largest amplitude. In that case a scale factor should be 

4 transmitted for every subband and every channel in order to enable the inverse 

5 operation to be performed upon the samples at the receiving end. For this 

6 purpose the third frame portion then contains scale factors SF j,m, one for each 

7 of the quantised subband-signal components in the various subbands. 

8 In the present example, scale factors are represented by 6-bit numbers, 

9 the most significant bit first, the values ranging from 000000 to 111110. The 
10 scale factors of the subbands to which these are allocated, i.e. whose allocation 

. 1 1 information is non-zero, are accommodated in the leading part of the frame 

1 2 portion FD3 before the samples. This means that the scale factors are 

1 3 transmitted before the transmission of the samples begins. As a result rapid 

14 decoding in the receiver 5 can be achieved without the necessity of storing all 

1 5 the samples in the receiver, as will become apparent hereinafter. A scale factor 

16 SF j,m can thus represent the value by which the samples of the signal in the j- 

17 th channel of the m-th subband have been multiplied. Conversely, the number 
1 8 one divided by this value may be stored as the scale factor so that at the 

19 receiving end it is not necessary to divide the scale factors before the samples 

20 are scaled up to the correct values. 

21 For the frame format A the maximum number of scale factors is 64. 

22 If the allocation word AW j,m for a specific channel j and a specific subband m 

23 has the value 0000, which means that for this channel and this subband no 

24 samples are present in the frame portion FD3, it will not be necessary to include 

25 a scale factor for this channel and this subband. The number of scale factors is 

26 then smaller than 64. The sequence in which the scale factors SF j,m are 

27 inserted in the third frame portion FD3 is the same as that in which the 

28 allocation words have been inserted in the second frame portion. The sequence 

29 is therefore as follows: 



w:\tr\sv23tra0.1nr 



16 



PHN 13,241B 



1 SF 1,1; SF 11,1; SF 1, 2; SF II, 2; SF i,3; SF II, 3; ... SF 1,32; SF 11,32. 

2 If it is not necessary to insert a scale factor the sequence will not be 

3 complete. The sequence may then be for example: 

4 ...SF 1,4; SF 1,5; SF 11,5; SF 11,6;.... 

5 In this case the scale factors for the fourth subband of channel II and the sixth 

6 subband of channel I are not inserted. If the frame is a B format frame it may 

7 still be considered to insert scale factors in the third frame portion for all the 

8 subbands and all the channels. However, this is not the only option. In this 

9 case it would also be possible to insert scale factors in the third frame portion of 
10 the frame for the subbands 1 to 1 6 only. In the receiver this requires a memory 
1 1 in which all scale factors can be stored at the instant at which a previously 

12 arriving A format frame is received. Subsequently upon reception of the B 

13 format frame only the scale factors for the subbands 1 to 1 6 are replaced by the 

14 scale factors included in the B format frame. The scale factors of the previously 
1 5 received A format frame for the subbands 1 7 to 32 are then used in order to 

16 restore the samples for these subbands included in the third frame portion of the 

17 B format frame to the correct scale. 

1 8 The samples are inserted in the third frame portion FD3 in the same 

19 sequence as the allocation words and the scale factors, one sample for every 

20 subband of every channel in succession. According to this sequence, first all 

21 the first samples for the quantised subband signals for ail the subbands of both 

22 channels are inserted, then all the second samples, — etc. The binary 

23 representation of the samples is arbitrary, the binary word comprising only 

24 "ones" preferably not being used again. 

25 The second digital signal generated by the transmitter 1 is 

26 subsequently applied to the transmission medium 4 by the output 7, and by 

27 means of the transmission medium 4 this signal is transferred to the receiver 5. 

28 Transmission through the transmission medium 4 may be a wireless 

29 transmission, such as for example a radio transmission channel. Many other 
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1 transmission media are also possible. In this respect optical transmission may 

2 be envisaged, for example over optical fibres or optical record carriers, such as 

3 Compact-Disc-like media, or transmission by means of magnetic record carriers 

4 utilising RDAT or SDAT-like recording and reproducing technologies, for which 

5 reference is made to the book "The art of digital audio" by J.Watkinson, Focal 

6 Press, London 1988. 

7 The Receiver 

8 As shown in Fig. 4, the receiver 5 comprises a decoder, which 

9 decodes the signal encoded in the coder 6 of the transmitter 1 and converts it 
10 into a replica of the wide-band digital signal supplied to the output 8. The 

1 1 essential information in the incoming signal is contained in the scale factors and 

1 2 the samples. The remainder of the information in the second digital signal is 

13 merely required for a "correct bookkeeping", to allow a correct decoding. The 

14 receiver first derives the synchronising and system information from the frames. 
1 5 The decoding process is then repeated for every incoming frame. 

16 Fig. 12 shows a more detailed version of the receiver 5 of Fig. 4. The 

17 coded signal (the second digital signal) is applied to a unit 1 1 through the 

18 terminal 10. For every frame, the unit 19 first detects the sync words situated 

19 in the first 16 bits of the first frame portion. Since the sync words of 

20 successive frames are each time spaced apart by an integral multiple of P' or 

21 P' + 1 information packets, the sync words can be detected very accurately. 

22 Once the receiver is in synchronism the sync word can be detected in the unit 

23 19. To accomplish this, a time window having, for example, a length of one 

24 information packet is opened after each occurrence of P' information packets, so 

25 that only that part of the incoming information is applied to the sync word 

26 detector in the unit 19. If the sync word is not detected the time window 

27 remains open for the duration of another information packet because the 

28 preceding frame may be a frame comprising P' + 1 information packets. From 

29 these sync words a PLL in the unit 19 can derive a clock signal to control the 
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1 central processing unit 18. 

2 It is evident from the above that the receiver should know how many 

3 information packets are contained in one frame. For this purpose the switching 

4 means 1 5 are then in the upper position shown, to apply the system information 

5 to the processing unit 18. The system information can now be stored in a 

6 memory 18a of the processing unit 18. The information relating to the number 

7 of information packets in a frame can be applied to the unit 19 over a control- 

8 signal line 20, to open the time window at the correct instants for sync-word 

9 detection. When the system information is received the switch 1 5 is changed 
10 over to the lower position. The allocation information in the second frame 

1 1 portion of the frame can now be stored in the memory 18b. 

12 If the allocation information in the incoming frame does not comprise an 

1 3 allocation word for all the subbands and channels this will have become 

14 apparent already from the detected system information. This may be for 

1 5 example the information indicating whether the frame is an A-format or a B- 

16 format frame. Thus, under the influence of the relevant information contained in 

17 the system information, the processing unit 18 will store the received allocation 

18 words at the correct location in the allocation memory 18b. 

19 It is obvious that in the present example the allocation memory 18b 

20 comprises 64 storage positions. If no scale factors are transmitted, the 

21 elements bearing the reference numerals 11,12 and 17 may be dispensed with, 

22 and the content of the third frame portion of a frame is applied directly by the 

23 connection 16 from the input 10 to a synthesis filter 21. The samples are 

24 applied to the filter 21 in the same sequence as the order in which the filter 21 

25 processes the samples in order to reconstruct the wide-band signal. The 

26 allocation information stored in the memory 18b is required in order to divide the 

27 serial data stream of samples into individual samples in the filter 21, each 

28 sample having the correct number of bits. For this purpose the allocation 

29 information is applied to the filter 21 over the line 22. 
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1 The receiver further comprises a deemphasis unit 23 which subjects the 

2 reconstructed digital signal supplied by the filter 21 to deemphasis. For a 

3 correct deemphasis the relevant information in the bits 24 to 31 of the first 

4 frame portion should be applied from the memory 18a to the deemphasis unit 23 

5 over the line 24. 

6 If the system uses scale factors in this format, the receiver will 

7 include the switch 1 1, the memory 12, and the multiplier 17, and the third 

8 frame portion will contain the scale factors SF j,m. Because of a control signal 

9 applied by the processing unit 18 over the line 13, the switch 1 1 is in the lower 

10 position at the instant at which the third frame portion FD3 of a frame arrives. 

1 1 Address signals are supplied to the memory 1 2 by the processing unit 18 over 

12 the line 14. The scale factors are then stored in the memory 12, which has 64 
1 3 locations for the storage of the 64 scale factors. If a B-format frame is being 
14 received, the processing unit 18 applies such address signals to the memory 12 

1 5 that only the scale factors for the subbands 1 to 1 6 are overwritten by the scale 

1 6 factors in the B-format frame. Subsequently, as a result of another control 

17 signal applied over the line 13, the switch 1 1 is changed to the upper position 

18 shown in the drawing, so that the samples are applied to the multiplier 17. 

19 Using the allocation information, which is now applied to the multiplier 17 over 

20 the line 22, the multiplier first derives the individual samples of the correct bit 

21 length from the serial data stream applied over the line 16. 

22 The samples are then multiplied so as to restore them to the correct 

23 values which the original samples had prior to scaling down in the transmitter. 

24 If the scale factors stored in the memory 1 2 are the scale factor values by which 

25 the samples have been scaled down in the transmitter, these values should first 

26 be inverted (one divided by the value) before application to the multiplier 17. 

27 Obviously, it is also possible to invert the scale factors upon reception before 

28 they are stored in the memory 1 2. If the scale factors in the frames are already 

29 equal to the value by which the samples should be scaled up during reception 
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they can be stored directly in the memory 1 2, and can then be applied directly 
to the multiplier 1 7. 

It is evident that no memory is required to store all these samples before 
the signal processing performed upon the samples contained in the frame 
begins. At the instant at which a sample arrives over the line 16 all the 
information required for processing this sample is already available, so that 
processing can be carried out immediately. This entire process is controlled and 
synchronized by control signals and clock signals applied to all the parts of the 
transmitter by the processing unit 18. 

Not all the control signals are shown. This is not necessary because the 
details of operation of the receiver will be obvious to those skilled in the art. 
Under control of the processing unit 18 the multiplier 17 multiplies the samples 
by the appropriate multiplication factors. The samples, which have now been 
restored to the correct amplitude, are applied to the reconstruction filter 18 in 
which the subband signals are reconverted to form the wide-band digital signal. 
Further description of the receiver is not necessary because such receivers are 
generally known, for example as described in the Thiele et al article cited above. 
Moreover, it will be evident that if the system information is also transmitted the 
receiver can be highly flexible and can correctly decode the signals even if the 
second digital signals contain different system information. 
Other embodiments 

Fig. 13 shows diagrammatically another embodiment of the 
transmitter, in the form of a recording device for recording the wide-band digital 
signal on a record carriersuch as a magnetic record carrier 25. The encoder 6 
supplies the second digital signal to a recording device 27 comprising a write 
head 26 by means of which the signal is recorded in a track on the record 
carrier. It is then possible to record the second digital signal in a single track on 
the record carrier, for example by means of a helical-scan recorder. In this case 
the single track can be divided into juxtaposed tracks which are inclined relative 
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1 to the longitudinal direction of the record carrier. An example of this is an 

_ 2 RDAT-like recording method. Another method is to split the information and 

3 simultaneously recording the split information in a plurality of juxtaposed tracks 

4 which extend on the record carrier in the longitudinal direction of the record 

5 carrier. For this the use of an SDAT-like recording method may be considered. 

6 A comprehensive description of the two above methods can be found in the 

7 aforementioned book "The art of a digital audio" by J. Watkinson. 

8 Again it is to be noted that the signal supplied by the unit 6 may be first 

9 be encoded in a signal converter. This encoding may, for example, be an 8-to- 

10 10 conversion followed by an interleaving process, as described with reference 

11 to Fig. 4. If the encoded information is recorded on the record carrier in a 

1 2 plurality of adjacent parallel track, the signal converter should also be capable of 

1 3 assigning the encoded information to the various tracks. 

14 Fig. 14 shows diagrammatically an embodiment of the receiver 5, which 

1 5 may be used in conjunction with the transmitter of Fig. 13; the two may form 

1 6 one apparatus which then provides transmission over time instead of distance. 

17 The receiver shown is a player or read device for reading the record carrier 25 

18 on which the wide-band digital signal has been recorded by means of the device 

19 shown in Fig. 13, in the form of the second digital signal described above. The 

20 second digital signal is read from a track on the record carrier by the read head 

21 29 and is applied to the receiver 5, which may be for example of a construction 

22 as shown in Fig. 12. Again the read device 28 may be constructed to carry out 

23 an RDAT-like or an SDAT-like reproducing method. Both methods are again 

24 described comprehensively in the aforementioned book by Watkinson. 

25 If the signal supplied by the unit 6 in the recording device shown in Fig. 

26 13 has been converted, for example in an 8-to-10 conversion and in an 

27 interleaving step, the encoded signal read from the record carrier 25 should first 

28 be de-interleaved and should be subjected to 1 0-to-8 conversion. Moreover, if 

29 the encoded signal has been recorded in a plurality of parallel tracks the 
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1 reproducing unit shown in Fig. 14 should arrange the information read from 

, 2 these tracks in the correct sequence before further processing is applied. 

3 Figs. 1 5a - d show a number of other possibilities of inserting the 

4 scale factors and the samples in the third frame portion FD3 of a frame. Fig. 

5 1 5a illustrates the above described method in which the scale factors SF for all 

6 the subbands m and channels (I or II) are inserted in the third frame portion 

7 before the samples. Fig. 15b illustrates the same situation as Fig. 15a, but in 

8 this case it diagrammatically represents the storage capacity for the scale 

9 factors SF l,m and SF ll,m and the associated x samples for these two channels 
10 in the subband m. Fig. 15b shows the samples for the two channels in the 

1 1 subband m combined to blocks, whereas normally they are distributed within the 

12 third frame portion. The samples have a length of y bits. In the above example 

13 x is 1 2 and y is now taken to be 8. 

1 4 Stereo coding 

1 5 Fig.1 5c shows another format. The two scale factors for the first and the 

16 second channel in the subband are still present in the third frame portion. 

1 7 However, instead of the x samples for both channels (the left and right channels 

18 for a stereo signal) in the subband m (i.e. 2x samples in total) only x samples for 

19 the subband m are included in the third frame portion. These x samples are 

20 obtained, for example, by adding corresponding samples in each of the two 

21 channels to one another. Thus a monophonic signal is generated and 

22 transmitted for this subband m. 

23 The x samples in Fig. 1 5c each have a length of z bits. If z is equal to y 

24 this saves room in the third frame portion, which can be used for samples 

25 requiring a more accurate quantisation. It is alternatively possible to express the 

26 x samples of the mono signal in Z = 2y ( = 1 6) bits. Such a signal processing is 

27 applied if the phase difference between the left-hand and the right-hand signal 

28 component in a subband is irrelevant but the waveform of the monophonic 

29 signal is important. This applies in particular to signals in higher subbands 
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1 because the phase-sensitivity of the ear for the frequency in these subbands is 

. 2 smaller. By expressing the x samples of the mono signal in 1 6 bits the waveform 

3 is quantised more accurately, while the room occupied by these samples in the 

4 third frame portion is equal to that in the example illustrated in Fig. 15b. 

5 Yet another possibility is to represent the samples by an intermediate 

6 number of bits, for example 1 2 bits. The signal definition is then more accurate 

7 than in the example illustrated in Fig. 1 5b, while at the same time room is saved 

8 in the third frame portion so that the bits saved can be allocated where the need 

9 is greater. 

10 When the signals included in the third frame portion as illustrated in 

1 1 Fig. 15c are reproduced at the receiving end, a stereo effect is obtained which is 

12 referred to as "intensity stereo". Here, only the intensities of the (eft-channel 

13 and the right-channel signals (in the subband m) can differ because of a different 

14 value for the scale factors SF l,m and SF II, m. Thus different kinds of 

1 5 information relating to the stereo nature of the audio signal can be represented 

16 by the composite signals and other signals which are transmitted. 

17 Fig. 1 5d shows still another possibility. In this case there is only 

18 one scale factor SFm for both signal components in the subband m. This is a 

19 situation which is particularly apt to occur in low-frequency subbands. 

20 Yet another possibility, which is not shown, is that the x samples for the 

21 channels I and II of the subband m, as in Fig. 1 5b, do not have associated scale 

22 factors SF l,m and SF II, m. Consequently, these scale factors are not inserted in 

23 the same third frame portion. In this case the scale factors SF l,m and SF II, m 

24 included in the third frame portion of a preceding frame must be used for scaling 

25 up the samples in the receiver. 

26 All the possibilities described with reference to Figs .15a - d can be 

27 employed in the transmitter in order to achieve a most efficient data transfer 

28 over the transmission medium. Thus, frames as described with reference to 

29 different ones of Figs. 1 5a - d may occur alternately in the data stream. It will 
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1 be appreciated that, if the receiver is to be capable of correctly decoding these 

2 different frames, information about the structure of these frames must be 

3 included somewhere, such as in the system information. 

4 The transmitter 

5 Fig. 16 shows the transmitter in more detail, particularly with respect to 

6 combination of the various items of information to form the serial data stream 

7 shown in Figs.1, 2 and 3. Fig. 1 6 in fact shows a more detailed version of the 

8 encoder 6 in the transmitter 1 . The encoder has a central processing unit 30, 

9 which controls a number of the encoder circuits; and also includes a generator 
10 31 for generating the synchronising information and the system information 

1 1 described with reference to Fig. 3, a generator 32 for supplying allocation 

1 2 information, a generator 33 (optional) for supplying the scale factors, a 

1 3 generator 34 for supplying the samples for a frame, and a generator 35 for 

1 4 generating the additional information packet IP P' + 1 . 

1 5 The outputs of these generators are coupled to associated inputs of a 

1 6 multiplexer 40 shown as a five-position switch whose output is coupled to the 

17 output 7 of the encoder 6. The CPU 30 controls the multiplexer or switch 40 

18 over the line 53, and the various generators over the lines 41.1 to 41.4. 

1 9 The operation of the transmitter will be described for a mono signal 

20 divided into M subband signals. These M subband signals S SB1 to S SBM are 

21 applied to the encoder input terminals 45.1, 45.2, 45. M. If scale factors are 

22 to be used, blocks of samples of each of the subband signals are processed 

23 together in the optional subband scaling units 46.1 to 46. M. A number, for 

24 example twelve, of samples in a block are scaled to the amplitude of the largest 

25 sample in the block. The M scale factors are supplied to the unit 33 (if present) 

26 over the lines 47.1 to 47. M. The subband signals are supplied both to an 

27 allocation control unit 49 and (scaled if that option is in use) to M quantisers 

28 48.1 to 48. M. For every subband the allocation control unit 49 defines the 

29 number of bits with which the relevant subband signal should be quantised. 
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1 This allocation information is applied to the respective quantisers 48.1 to 48. M 

2 over the lines 50.1 to 50. M, so that these quantisers correctly quantise the 12 

3 samples of each of the subband signals; and is also supplied to the generator 

4 32. The quantized samples of the subband signals are supplied to the generator 

5 34 over the lines 51.1 to 51. M. The generators 32, 33 and 34 arrange the 

6 allocation information, the scale factors and the samples in the correct sequence 

7 described above. 

8 In the position of the multiplexer or switch 40 shown, the synchronising 

9 and system information associated with the frame to be generated is supplied by 
10 the generator 31 in the CPU 30 and fed to the encoder output 7. Subsequently, 
1 1 the multiplexer or switch 40 responds to a control signal supplied by the CPU 30 
1 2 over the line 53, and is set to the second position from the top so that the 

13 output of the generator 32 is coupled to the output 7. The sequence of 

14 allocation information as described with reference to Fig. 10 or 1 1 is the 

1 5 supplied. After this the switch 40 is set to the third position from the top, 

1 6 coupling the output of the generator 33 to the output 7, and the generator 33 

1 7 now supplies the scale factors in the correct sequence. The switch 40 is then 

18 set to the next position, so that the output of the generator 34 is coupled to the 

1 9 output 7, and the generator 34 supplies the samples in the various subbands in 

20 the correct sequence. In this cycle exactly one frame is applied to the output 7. 

21 Subsequently, the switch 40 is reset to the top position. A new cycle is then 

22 started, in which a subsequent block of 1 2 samples for each subband is 

23 encoded and a subsequent frame can be generated on the output 7. 

24 In some cases, for example if the sample frequency F 3 is 44.1 kHz (see 

25 Fig. 5) an additional information packet (the dummy slot, see Fig. 2) must be 

26 added. In that case, after the generator 34 has finished supplying the samples, 

27 the multiplexer or switch 40 will be set to the bottom position. The output of 

28 the generator 35 is now coupled to the output 7, and the generator 35 

29 generates the additional information packet IP P' + 1 . After this the switch 40 is 
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1 reset to the top position to start the next cycle. 
.2 It will be clear that, if the signal received by the transmitter is to be 

3 corrected for errors caused during transmission of the signal, an appropriate 

4 error coding and/or interleaving should be applied to the second digital signal. In 

5 addition, prior to transmission some modulation is usually required. Thus, the 

6 digital signal transmitted through the transmission medium may not be directly 

7 identifiable as the second signal, but will be a signal which has been derived 

8 therefrom. 

9 It will to be noted that, for example in the case that the subbands 

10 have different widths, the number of samples for the various subbands inserted 

11 in one third frame portion may differ and are likely to differ. If it is assumed, for 

12 example, that a division into three subbands is used, including a lower subband 

13 SB 1r a central subband SB 2 and an upper subband SB 3 , the upper subband may 

14 have a bandwidth which is, for example, twice as large as that of the other two 
1 5 subbands. This means that the number of samples inserted in the third frame 

16 portion for the subband SB 3 is probably also twice as large as for each of the 

17 other subbands. The sequence in which the samples are applied to the 

1 8 reconstruction filter in the receiver may then be: the first sample of SB 1r the first 

1 9 sample of SB 3 , the first sample of SB 2 , the second sample of SB 3 , the second 

20 sample of SB 1f the third sample of SB 3 , the second sample of SB 2 , the fourth 

21 sample of SB 3 ,.... etc. The sequence in which the allocation information for 

22 these subbands is then inserted in the second frame portion will then be: first 

23 the allocation word for SB 1f then the allocation word of SB 3 , and subsequently 

24 the allocation word for SB 2 . The same applies to the scale factors. Moreover, 

25 the receiver can derive, from the transmitted system information, that in this 

26 case the cycle comprises groups of four samples each, each group comprising 

27 one sample of SB 1r one sample of SB 3 , one sample of SB 2 and subsequently 

28 another sample of SB 3 . 
29 
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1 Other Frame Arrangements 
_2 Figure 17 shows another structure of the first frame portion FD1. 

3 Again the first frame portion FD1 contains exactly 32 bits and therefore 

4 corresponds to one information packet. The first 1 6 bits again constitute the 

5 synchronising signal (or synchronisation word). The synchronisation word may 

6 also be the same as the synchronisation word of the first frame portion FD1 in 

7 Fig. 3, but the information accommodated in bits 16 through 31 differs from the 

8 information in bits 16 through 31 in Fig. 3. The bits b 1g through b^ represent 

9 a 4-bit bit rate index {BR index) number whose meaning is illustrated in the 

10 Table in Fig. 18. If the bit rate index is equal to the 4-bit digital number '0000' 

1 1 this denotes the free-format condition, which means that the bit rate is not 

1 2 specified and that the decoder has to depend upon the synchronisation word 

1 3 alone to detect the beginning of a new frame. The 4-bit digital number '1111' 

14 is not employed in order not to disturb the synchronisation word detection. In 

1 5 the second column of the table of Fig. 18 the bit rate index is represented as a 

1 6 decimal number corresponding to the 4-bit digital number. The corresponding 

1 7 bit rate values are given in column 1 . 

18 With this format the first frame portion contains information related to the 

19 number of information packets in the frame. As shown in Fig. 18, the sample 

20 frequency F s is defined by one of the four possible 2-bit digital numbers for the 

21 bits k>2Q and b^ nav ' n 9 tne va 'ues listed. Bit 22 indicates whether the frame 

22 comprises a dummy slot, in which case b 22 = or does not comprise a 

23 dummy slot, i n which case ^ 22 — '0'. Along with other predetermined 

24 information, then, the information in the bits b 16 through b 22 makes it possible 

25 to determine how many information packets are actually present in the frame. 

26 From the number of samples of the wide-band signal whose 

27 corresponding information belonging to the second digital signal is 

28 accommodated in one frame, in the present example n s = 384, it is possible to 

29 determine how many information packets B are present in the frame by means 
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of the data in the Table in Fig. 8 r the padding bit b 22 and tne formula 



BR 



P -N 



3 The bit b 23 is intended for specifying a future extension of the system. 

4 This future extension will be described hereinafter. For the time being this bit is 

5 assumed to be '0'. 

6 Indicator Signals 

7 Various indicator and control signals are provided by the bits b 2 ^ through 

8 b 3V wnicn wi " De described with reference to Figs. 19 and 20. The bits b 24 

9 and b 2 ^ give the mode indication for the audio signal. For the four possibilities 

10 of this two-bit digital number Fig. 20 shows whether the wide-band digital signal 

11 is a stereo audio signal C00') r a mono signal ('11'), a bilingual signal ('10'), or 

12 an intensity stereo audio signal ('01'). In the last-mentioned case the bits 26 

13 and 27 indicate which subbands have been processed in accordance with the 

14 intensity stereo method. In this example the respective two-bit numbers "00', 

15 '01', '10', and '11' mean, respectively, that the subbands 5-32, 9-32, 13-32 

16 and 17-32 have been processed in accordance with the intensity stereo method. 

17 As stated hereinbefore intensity stereo can be applied to the higher subbands 
1 8 because the ear is less phase-sensitive for the frequencies in these subbands. 

1 9 The bit b 28 can be used as a copyright bit. If this bit is '1 ' this means that the 

20 information is copy-protected and should/cannot be copied. The bit b 2 g can 

21 indicate that the information is original information (b 2 g = '1'), for example in 

22 the case of the prerecorded tapes, or information which has been copied (b 2 g = 

23 '0'). The bits b^Q and bg 1 specify the emphasis which may have been applied 

24 to the wide-band signal in the transmitter, for example as described with 

25 reference to Fig. 7. 

26 OVarious configurations of the second frame portion FD2 may be 

27 described by the various mode indications represented by the bits b 24 through 

28 b 27 in the first frame portion. The second frame portion comprises the 4-bit 
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1 allocation words whose meaning has been described with reference to Fig. 9. 

. 2 For the stereo mode (b 24 , b 25 = 00) and the bilingual mode (b 24 , b 25 = 10) 

3 the second frame portion FD2 again has a length of 8 information packets (slots) 

4 and is composed as described with reference to Fig. 10. In the stereo mode 'I' 

5 in Fig. 10 then represents, for example, the left-channel component and 'II' the 

6 right channel component. For the bilingual mode T denotes one language and 

7 'II' denotes the other language. For the mono mode (b 2 ^, b 2 § = 11) the length 

8 of the second frame portion FD2 is of course only 4 information packets (slots). 

9 Fig. 21 illustrates the sequence of the allocation words for the various 
10 subbands 1 through 32 in the four information packets (slots) 2 through 5. 

1 1 Thus, every quantity M-i represents a four-bit allocation word which specifies 

12 the number of bits in every sample in the subband of the sequence number i, i 

13 ranging from 1 to 32. In the intensity stereo mode (t> 2 4» b 2 g = 01) there are 

14 four possibilities indicated by means of the bits t> 2 g and b 2 ^, see Fig. 20. All 

1 5 these possibilities result in a different content of the second frame portion FD2. 
1 6 Figs. 22a to 22d illustrate the four different contents of the second frame 

17 portion. If the switch bits b 26 , b 27 are '00' the signals in the subbands 1 

18 through 4 are normal stereo signals and the signals in the subbands 5 through 
1 9 32 are intensity-stereo signals. This means that for the subbands 1 through 4 

20 for the left-hand and right-hand channel components in these subbands the 

21 associated allocation words should be stored in the second frame portion. In 

22 Fig. 22a this is represented by the consecutive allocation words AW (I, 1); AW 

23 (R, 1 ); AW (I, 2); AW (R, 2); ... AW (R, 4), stored in the slot 2 of the frame, i.e. 

24 the first slot of the second frame portion. Fig. 22a only gives the indices (i-j) of 

25 the allocation words, i being equal to L or R and indicating the left-hand and the 

26 right-hand channel component respectively, and j ranging from 1 through 4 and 

27 representing the sequence number of the subband. For the subbands 5 through 

28 32 the left-hand and the right-hand channel components contain the same series 

29 of samples. The only difference resides in the scale factors for the left-hand and 
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1 the right-hand channel components in a subband. Consequently, such a 

. 2 subband requires only one allocation word. The allocation words AW (i, j) for 

3 these subbands 5 through 32 are indicated by the indices M-j, where i is 

4 consequently equal to M for all the subbands and where j ranges from 5 through 

5 32. 

6 Fig. 22a shows that 4 1/2 information packets are required for 

7 inserting the 36 allocation words in the second frame portion. If the switch bits 

8 b 26' b 27 are 01 '' the s '9 na,s in tne subbands 1 through 8 will be normal stereo 

9 signals and the signals in the subbands 9 through 32 will be intensity-stereo 

10 signals. This means that for each of the subbands 1 through 8 two allocation 

1 1 words AW(L, j) and AW(R,j) are required and that for each of the subbands 9 

12 through 32 only one allocation word AW(M,j) is required. This implies that in 

13 total 40 allocation words are needed, included in five information packets 

14 (slots), i.e. IP2 through IP6, of the frame. This is illustrated in Fig. 22b. In this 
1 5 case the second frame portion FD2 has a length of five information packets 

16 (slots). 

17 If the switch bits ^26' b 27 are the s '9 na,s in tne subbands 1 through 

18 12 will be normal stereo signals and the signals in the subbands 13 through 32 

19 will be intensity-stereo signals. Fig. 22c gives the structure of the second frame 

20 portion FD2 with the allocation words for the various subbands. The second 

21 frame portion now has a length of 5 1/2 information packets (slots) in order to 

22 accommodate all the allocation words. If the switch bits b 2 g, b 27 are '11' the 

23 signals in the subbands 1 through 16 will be normal stereo signals and the 

24 signals in the subbands 17 through 32 will be intensity-stereo signals. Now 48 

25 allocation words are needed, which are inserted in the second frame portion, 

26 which then has a length of 6 information packets (slots), see Fig. 22d. 

27 What has been stated above about the scale factors is also valid here. 

28 When it is assumed that an allocation word 0000 has been assigned neither to 

29 any of the subbands nor to any of the channels, 64 scale factors are required 
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1 both for the stereo mode and for the intensity-stereo modes. This is because in 

. 2 all the intensity-stereo modes every mono subband should have two scale 

3 factors to enable intensity-stereo to be realised for the left-hand and the right- 

4 hand channel in this subband (see Fig. 1 5c). It is obvious that in the mono 

5 mode the number of scale factors is halved, i.e. 32, again assuming that the 

6 allocation word 0000 has not been assigned to any of the subbands. 

7 Scale Factor Determination 

8 A method of determining the 6-bit scale factors will now be 

9 explained below. As stated hereinbefore, the sample having the largest absolute 

10 value is determined for every 12 samples of a subband channel. Line (a) of Fig. 

11 24 shows the binary representation of a maximal sample j S mgx | . The first 

12 bit, designated SGN, is the sign bit and is '0' because it relates to the absolute 

13 value of S mgx . The samples are represented in two's complement notation. 

14 The sample comprises k 'zeros' followed by a "1". The values of the other bits 

15 of the 24-bit digital number are not relevant and can be either '0' or '1'. 

1^ 

1 6 | s max I is now multiplied by 2 to produce the number shown in line 

17 (b) of Fig. 24. Subsequently | S mgx | .2 is compared with a digital number 

18 DV, equal to 010100001100000000000000 and a digital number DV 2 equal to 

19 011001100000000000000000. If | S mgx | .2 k < DV 1 a specific constant p 

20 is taken to be 2. If DV 1 < | S mgx | .2 k < DV 2 , then p is taken to be 1 . If 

21 | S max | .2 k > DV 2 , then p = 0. 

22 The number k is limited to 0 < k < 20. The scale factor is now determined by 

23 the numbers k and p in accordance with the following formula. 

24 SF = 3k + p. 

25 Consequently, the maximum value for SF is 62. This means that the scale 

26 factors can be represented by 6-bit numbers, the six-bit number 111111 (which 

27 corresponds to the decimal number 63) not being used. In fact, the 6-bit binary 

28 numbers are not the scale factors but they are in a uniquely defined relationship 

29 with the actual scale factors, as will be set forth below. All the 1 2 samples S 
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1 are now multiplied by a number which is related to the values for k and p. The 

.2 12 samples are each multiplied as follows: 

3 S'= Sx2 k xg(p) 

4 where the number g(p) has the following relation with p: 

5 g(p) = 1 for p = 0 

6 g(p) = 1 +2" 2 + 2" 8 + 2" 10 4-2" 16 + 2' 18 + 2" 23 for p = 1 

7 g(p) = 1 +2" 1 + 2- 4 + 2- 6 + 2- 8 + 2- 9 + 2- 10 + 2- 13 + 2- 15 + 2- 16 + 2- 17 + 

8 2" 19 + 2' 20 for p = 2. 

9 The parameter k specifies the number of 6 dB steps and the factors (g(1) and 
10 g(2) are the closest approximations to steps of 2 dB. The samples S' thus 

1 1 scaled are now quantised to enable them to be represented by q-bit digital 

- 12 numbers in two's complement notation. In Fig. 25 this is illustrated for q = 3. 

13 The scaled samples S' have values between +1 and -1, see Fig. 25a. In the 

14 quantiser these samples must be represented by q bits, q corresponding to the 

15 allocation value for the relevant subband (channel). Since, as stated above, the 
1 6 q-bit digital number comprising only 'ones' is not used to represent a sample the 

17 total interval from -1 to +1 should be divided over 

2 q-1 

smaller intervals. For 

18 this purpose the scaled samples S' are transformed into the samples S" in 

19 accordance with the formula S" = S'(1 -2" q )-2" q . 

20 The samples S" are subsequently truncated at q bits, see Fig. 25c. 

21 Since the '111' representation is not permissible the sign bits are inverted, see 

22 Fig. 25d. The q( = 3)-bit numbers given in Fig. 25d are now inserted in the third 

23 frame portion FD3, see Fig. 2. 

24 Samples S' which comply with -0.71 <S'<-0.14 are represented 

25 by the digital number '001'. This proceeds similarly for samples S' of larger 

26 values up to samples which comply with 0.71 < S'<1 and which are 

27 represented by the digital number "110'. Consequently, the digital number 

28 '1 1 1 ' is not used. 

29 Dequantization at the receiving side is effected in a manner inverse 
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1 to the quantization at the transmission side, see Fig. 26. This means that first 

. 2 the sign bits of the q-bit digital numbers are inverted to obtain the normal two's 

3 complement notation, see Fig. 26b. 

4 Subsequently the samples S' are derived from the transformed 

5 samples S" by means of the formula 

6 S' = <S"+ 2- q + 1 )(1 + 2 q +• 2" 2q + 2" 3q + 2" 4q + ...) 

7 (ee Figs. 26c and 26d). The values S' thus obtained are now situated exactly 

8 within the original intervals in Fig. 25a. At the receiving side the samples S' are 

9 subsequently scaled to the original amplitudes by means of the transmitted 

10 information k, p which is related to the scale factors. Thus, at the receiving side 

-11 a number g'(p) complies with: 
. 12 g'<p) = 1 for p = 0 

13 g'{p) = 2~ 1 + 2" 2 + 2" 5 + 2' 6 for p = 1 

14 g'(p) = 2~ 1 + 2" 3 + 2" 8 + 2" 9 for p = 2. 

1 5 Scaling to the original amplitudes is now effected using the following formula: 

16 S = S'.2" k .g'(p). 

17 In the two possible versions of a frame as described with reference 

18 to Figures 2 and 3 and Figures 2, 17 and 19 respectively the third frame portion 

19 may not be filled entirely with information. This will occur more often and 

20 sooner as the algorithms for subband coding, i.e. the entire process of dividing 

21 the signal into subband signals and the subsequent quantization of the samples 

22 in the various subbands, are improved. In particular, this will enable the 

23 information to be transmitted with a smaller number of bits (average number per 

24 sample). The unused part of the third frame portion can then be utilized for 

25 transmitting additional information. In the first frame portion FD1 in Fig. 17 

26 allowance has been made for this by means of the "future-use" bit ^>23' 

27 Normally, this bit is '0', as will be apparent from Fig. 18. 

28 Additional Signal 

29 If an additional signal has been inserted in the third frame portion FD3 of a 
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1 frame, the future-use bit b 23 in the first frame portion FD1 , see Fig. 17, will be 

,2 'V. During reading of the first frame portion FD1 this makes it possible for the 

3 receiver to detect whether the frame contains additional information. The 

4 allocation information and the scale factors, see Fig. 23, inform the receiver that 

5 only the part of the third frame portion FD3, marked FD4 in Fig. 23, contains 

6 quantised samples of the subband signals. The remainder, marked FD5 in Fig. 

7 23, now contains the additional information. The first bits in this frame portion 

8 FD5 are designated 'EXT INFO' or extension information. These bits indicate 

9 the type of additional information. The additional information may be, for 
10 example, an additional audio channel, for example for the transmission of a 

- 1 1 second stereo channel. Another possibility is to use these two additional audio 

12 channels to realise 'surround sound' together with the audio subband signals in 

13 the frame portion FD4. In that case the front-rear information required for 

14 surround sound may be included in the frame portion FD5. In the part marked 
1 5 FD6 the frame portion FD5 may again contain allocation information, scale 

1 6 factors and samples (in this order) and the sequence of the allocation words and 

1 7 the scale factors may then be similar to the sequence as described with 

18 reference to Figs. 2 and 3 and Figs. 2, 1 7 and 19. 

19 In the case of 'surround sound' simple receivers may merely decode the 

20 stereo audio information in the frame portions FD2 and FD3, except for the 

21 frame portion FD5. More sophisticated receivers are then capable of 

22 reproducing the surround-sound information and for this purpose they also 

23 employ the information in the frame portion FD5. 

24 The extension-info bits may also indicate that the information in the frame 

25 portion FD6 relates to text, for example in the form of ASCII characters. It may 

26 even be considered to insert video or picture information in the frame portion 

27 FD6, this information again being characterized by the extension-info bits. 

28 It is to be noted that the invention is not limited to the embodiments 

29 shown herein. The invention also relates to those embodiments which differ 
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from the embodiments shown herein with respect to features which are not 
relevant to the invention as defined in the Claims. 
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